Universal quantum simulation with pre-threshold superconducting qubits: 

Single-excitation subspace method 
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We propose a method for general-purpose quantum computation and simulation that is well 
suited for today's pre-threshold-fidelity superconducting qubits. This approach makes use of the 
n-dimensional single-excitation subspace (SES) of a system of n tunably coupled qubits. It can be 
viewed as a nonscalable special case of the standard gate-based quantum computing model, but 
allows many operations in the unitary group SU(n) to be implemented by a single application of the 
Hamiltonian. Our approach bypasses the need to decompose the evolution operator into elementary 
gates, making large, nontrivial computations possible without error correction. The method is 
especially well suited for universal quantum simulation, specifically simulation of the Schrodinger 
equation with a real but otherwise arbitrary nxn Hamiltonian. We argue that a 1000-qubit SES 
processor, which would require no known improvements in superconducting device technology and 
which could be built today, should be capable of achieving quantum speedup relative to a petaflop 
supercomputer. We speculate on the utility and practicality of such a universal quantum simulator. 

PACS numbers: 03.67.Lx, 85.25.Cp 



I. INTRODUCTION AND MOTIVATION 
A. The quest for quantum speedup 

In the standard gate-based model of quantum compu- 
tation with n qubits, the initial state of the (closed) sys- 
tem is represented by a 2™-dimensional complex vector 
if), and the computation is described by unitary time- 
evolution operator U. Running the quantum computer 
implements the map 



ip -> Utp. 



(1) 



A key feature here is the exponential classical informa- 
tion storage capacity of the wave function ip. However, 
the number of one- and two-qubit gates required to imple- 
ment an arbitrary element of the unitary group SU(2") is 
at least 2 2 ™ — 1, making the construction of an arbitrary 
U inefficient (using elementary gates) . The primary goal 
of quantum algorithm design is to implement interesting 
cases of U with a polynomial number of elementary gates 
(it is also necessary that the desired answer be obtained 
with high probability upon measurement, possibly after 
a polynomial number of repetitions) . The class of prob- 
lems that can be solved with success probability > i by 
a quantum circuit with depth bounded by a polynomial 
in the length of the input is called BQP. Two important 
algorithms in the BQP complexity class are the quan- 
tum algorithms for order finding [l| and weak simulation 
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of local (or sparse) Hamiltonians 0-0] ■ Although these 
algorithms are technically efficient, the number of gates 
required for interesting applications is still large [§-fll| 
and an error-corrected quantum computer is therefore 
required. 

A major milestone for the field of quantum computa- 
tion will be to achieve quantum speedup relative to the 
most powerful classical supercomputer (itself a rapidly 
moving target). Here we assume a general-purpose 
or universal quantum computer, in contrast with ana- 
log quantum simulators and other special purpose ap- 
proaches where the simulated Hamiltonian or problem 
type is restricted to a certain family. Analog quantum 
simulation (l2l - [l8j is already a highly developed and suc- 
cessful approach to quantum computation, and only 50 
or so degrees-of-freedom become classically intractable 
[l9j (fewer for analog open-system simulators pol ] , where 
decoherence is considered part of the problem to be sim- 
ulated). In fact, the break-even point, where the classical 
and quantum computation times are equal, has probably 
already been achieved with analog simulators. We will 
not attempt to identify these examples here, which would 
require a sharp degree-of-controllability criterion to dis- 
tinguish a "proper" analog quantum simulation from a 
more traditional experiment, which also has a number 
of control parameters and which can rarely be simulated 
classically. Currently, the best performing approach to 
universal fault-tolerant quantum computation (in terms 
of threshold value and hardware scalability) appears to 
be the surface code [2ll - [2"4j . Conservative estimates of the 
number n of physical qubits required for break-even with 
a 99.9% fidelity (superconducting) surface code quan- 
tum computer range from 10 5 for quantum simulation to 
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10 7 for factoring. Even running a long, error-corrected 
computation with a only a few logical qubits appears 
to require more than a thousand physical qubits. Thus, 
the break-even point for fault-tolerant universal quantum 
computation is likely to take some time to achieve. 



B. Pre-threshold quantum computation 

The single-excitation subspace (SES) method de- 
scribed here is not intended to replace the standard 
model of scalable, fault-tolerant quantum computation 
outlined above. Rather, it provides an alternative ap- 
proach to general-purpose quantum computation that 
can be implemented with today's pre-threshold hard- 
ware, especially superconducting circuits. We use the 
term "pre-threshold" to refer to qubits and architectures 
with insufficient fidelity to enable fault-tolerant quantum 
computation. State-of-the-art demonstrations of quan- 
tum algorithms with pre-threshold qubits are necessar- 
ily restricted to no more than about 10 qubits [25| - |36| . 
With superconducting circuits, this restriction is not a 
consequence of a limitation of fabrication (it is already 
possible to make devices with hundreds of qubits), but 
instead that the available coherence time inhibits their 
application to larger problem sizes, which would require 
larger circuit depth and longer computation times. The 
SES method bypasses the need to use one- and two-qubit 
gates, allowing larger computations to be carried out 
within the available coherence time. 



TABLE I. Three approaches to pre-threshold quantum com- 
putation and simulation. The left column lists attributes 
achievable by a universal, fault-tolerant quantum computer. 
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Table U compares three broad approaches to quantum 
computation with pre-threshold-fidelity qubits. "Small 
system" refers to gate-based computations with a few 
qubits, which have been used to test fundamental con- 
cepts of quantum information processing, demonstrate 
hardware functionality, and assess qubit and gate per- 
formance. The SES method is also general purpose, but 
enables quantum speedup; however neither approach is 
scalable. Analog quantum simulation and other scalable, 
special-purpose approaches trade universality for a faster 
route to speedup. We are not aware of any pre-threshold 
approaches that would allow computations of arbitrary 
accuracy or runtime. 



II. QUANTUM COMPUTATION IN THE SES 

A. Hardware model 

Consider the following model of an array of n coupled 
superconducting qubits, 

i ii' 

written in the basis of uncoupled-qubit eigenstates. Here 
i, i' = 1, 2, . . . , n, and 

The 6i are qubit transition energies and gw are qubit- 
qubit interaction strengths; both are assumed to be tun- 
able. gu> is a real, symmetric matrix with vanishing diag- 
onal elements. We also require microwave pulse control 
of at least one qubit, and simultaneous readout (projec- 
tive measurement of <j z ) of every qubit. Model © de- 
scribes a fully connected network of qubits, which we 
refer to as a fully connected quantum computer (FQC). 
The FQC model should be contrasted with local quantum 
computer (LQC) models that have coupling only between 
nearby qubits (nearest neighbors, for example). The SES 
method can be applied to a wide variety qubit-qubit in- 
teraction types (see appendix) , but without loss of gener- 
ality we restrict ourselves to the simple a x ® a x coupling 
of ([2]). Although we assume an architecture based on su- 
perconducting qubits, our approach might apply to other 
architectures as well. 

The FQC model © is usually considered to be un- 
scalable, because of the 0{n 2 ) tunable coupling circuits 
and wires, a position that we also adopt here [37j]- In 
gate-based universal quantum computation, the FQC 
and LQC models are actually equivalent in the sense 
that any quantum circuit implemented by a FQC can 
be implemented by a LQC after adding chains of SWAP 
gates, which only introduce polynomial overhead. How- 
ever, this equivalence is restricted to the standard gate- 
based approach and does not apply here. 



B. Device parameters 

Superconducting qubits have been reviewed in 
Refs. [H-[4lJ|. Although the model © can be realized 
with several qubit designs, the transmon qubit [42j cur- 
rently has the best performance. (We note that frequency 
tunability is required here, which will increase sensitiv- 
ity to flux noise). For concreteness we assume a qubit 
frequency e/27r in the range of 5 - 6 GHz and coupling 
strength .g/27r in the range -100 to 100 MHz. 
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FIG. 1. Superconducting phase qubit and tunable inductive 
coupler circuits. 




FIG. 2. (color online) SES processor layout. 



C. Processor layout 

A fully connected quantum computer of n qubits oper- 
ating in the SES mode requires n(n— l)/2 coupler circuits 
and associated wires. A variety of tunable couplers can 
be used for this purpose [4a - l48| . For definiteness, we 
consider the tunable inductive coupler [48| demonstrated 
with superconducting phase qubits and show a processor 
layout that avoids excessive crossovers. 

Circuit diagrams for a single phase qubit "q" and single 
coupler "c" are illustrated in Fig. [TJ where the crossed 
boxes represent Josephson junctions. In terms of these 
elements, a possible layout for a fully connected n = 5 
array is shown in Fig. [2j If we assume a qubit size of 
10/im x 10/im and a lOnm spacing of the coupler wires 
(blue vertical lines in Fig. [5]), then a 1000-qubit processor 
would require about a 1cm x 1cm chip. 



D. Single-excitation subspace 

The idea we explore here is to perform a quantum com- 
putation in the n-dimensional single-excitation subspace 
of the full 2™-dimensional Hilbert space. This is the sub- 
space spanned by the computational basis states 

|0 = 4|00---0) = |0 • - • li - - - 0>, (4) 

with i = 1, 2, . . . , n. Experimentally, it is possible to pre- 
pare the quantum computer in the SES, and it will re- 
main there with high probability if the following condi- 
tions are satisfied: 

1. The coupling strengths \gu>\ are much smaller than 
the €i, which is usually well satisfied in supercon- 
ducting architectures. (This condition is the same 
as that for the applicability of the rotating-wave 
approximation. ) 

2. Singlc-qubit operations such as tt and 7r/2 pulses 
are not used during the computation. However, 2-7T 
pulses are permitted and turn out to be extremely 
useful (these can be implemented as z rotations, 
which do not require microwaves), n pulses are of 
course used to prepare SES states from the system 
ground state |00 • • • 0). 

3. The quantum computation time < qu is much less 
than the single-qubit energy relaxation time T±. 

A general SES state is of the form 

n n 

w = 5>ii), Ew J = 1 - ( 5 ) 

i=l i=l 

which corresponds to a point on the sphere S 2 ™ -1 . For 
example, the states ([5]) include the maximally entangled 
W-type state 

lunif) = IP + |2) + - + I") 

|10---0) + |01---0)+--- + |00---l) 
= 7= • (6) 

E. SES Hamiltonian 

The advantage of working in the SES can be under- 
stood from the following expression for the SES matrix 
elements of model $2$, 

Hu> = (i|ffFQc|*') = ei&w +9w- (7) 

Because the diagonal and off-diagonal elements are di- 
rectly and independently controlled by the qubit ener- 
gies and coupling strengths, respectively, we have a high 
degree of control over the SES component of the quan- 
tum computer's Hamiltonian. This property allows many 
n-dimensional unitary operations to be carried out in a 
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single step, bypassing the need to decompose into ele- 
mentary gates, and also enables the direct quantum sim- 
ulation of real but otherwise arbitrary time-dependent 
Hamiltonians (a preliminary account of this application 
is given in Ref. |49j). 

There are two reasons why the SES Hamiltonian is 
not completely controllable: 

1. Ha' is real, whereas the most general Hamiltonian 
is complex Hermitian. The experimentally avail- 
able control parameters, consisting of n qubit ener- 
gies and n(n— 1)/2 coupling strengths, are sufficient 
to control n(n+ 1)/2 independent paramaters of an 
nxn symmetric matrix that is real. 

2. There are experimental limitations on the range of 
values that the ej and g^i can take. We define g max 
to be the magnitude of the largest coupling avail- 
able in a particular experimental realization (a re- 
alistic value is 100 MHz). 

We will leave the discussion of possible generalizations to 
complex Hamiltonians for future work. 



III. APPLICATIONS 

A. Entangled state preparation 

It is simple, for example, to generate the entangled 
state © in a single step: Consider the nxn real Hamil- 
tonian 



n = g 
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(8) 



where qubit 1 is symmetrically coupled to all other 
qubits, which are themselves uncoupled (a star network). 
Only two of the eigenfunctions — call them \ip±) — have 
overlap with |1), so the evolution of the |1) state is effec- 
tively a two-channel problem. The spectrum is as follows: 
States \ip±) have energy e± = g(l± \/n); all other eigen- 
functions are degenerate with energy e = 0. Evolution 
for half a period corresponding to the splitting 2^/ng, 
namely 



to 



2y/ng' 

leads to the desired operation 



lif) = exp 



(9) 
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(10) 



This can be implemented in a few ns with superconduct- 
ing circuits. 



B. Grover's search algorithm 

Although we have argued that the SES method is prob- 
ably most useful for universal quantum simulation, it is 
also interesting to consider a traditional quantum algo- 
rithm. Here we apply the SES method to Grover's search 
150] for a single marked state \i) in a database of size n, 
which takes the form 



(WOi) K \um{) » |»), 

W= 2|unif)(unif| - I 
( 2 



Here 



i — 

with K = — v/n. 
4 



(11) 
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is the inversion operator, 
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(12) 



Oi 



1 



V 



(13) 



V 



is the oracle (a diagonal matrix with the ith element 
equal to —1 and the others equal to 1), and |unif) is 
the uniform superposition ^ . The operator (fT2"j) can be 
implemented in a single step using 



(14) 



which describes a completely symmetric, fully coupled 
array, for a time 
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(15) 



(16) 



7T 

^qu — ■ 

ng 

This leads to the desired operation 



W = exp 



up to a multiplicative phase factor. The oracle (fT3j) can 
be generated by a 2ir rotation on qubit i. Each itera- 
tion of the search can be implemented in a few ns, al- 
lowing SES processors of even modest sizes to perform 
computations that would otherwise require thousands of 
elementary gates. 
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C. Time-independent Schrodinger equation solver 

Finally, we consider wave function propagation by 
a real but otherwise arbitrary time-independent n x n 
Hamiltonian H, 



-iHt 



(17) 



Here t is the length of simulated time (for example, the 
duration of some physical process to be simulated). To 
map this to the SES processor we first find the smallest 
positive constant A such that every matrix element of 



H — const x I 
A 



(18) 



is beween — g max and g max . With the SES processor we 
then simulate the equivalent evolution 



where 



^-^qc ^qc 



At. 



(19) 



(20) 



The time t qu required to perform a single run of the quan- 
tum computation is therefore 



in 



^qc ~l~ tn 



(21) 



where i mca s is the (projective) qubit measurement time, 
which for superconducting qubits we can assume to be 
about 100 ns 

How long does the corresponding classical simulation 
take? This depends on the particular Hamiltonian, 
through the value of A. However, it is possible to as- 
sess the possibility of quantum speedup by finding the 
time required to classically simulate a typical run of the 
SES processor, where H qc is an nxn random symmetric 
matrix with elements bounded by g max . The optimal al- 
gorithm used to perform this classical simulation depends 
on the value of i qc : When t qc is less than a parameter t* 
that depends on n, the fastest way to classically simulate 
(|19[) is to solve the Schrodinger equation 



<9i|-0)qc = -iffqclVO 



qc 



(22) 



as a set of n coupled ordinary differential equations. In 
this regime the classical simulation time scales linearly 
with t qc . When i qc > t* the fastest way to simulate (JT 



is by matrix exponentiation or diagonalization, and the 
classical simulation time is independent of t qc . 
Next we specialize to the case 

n = 1000 and = 100 MHz. (23) 

In this case it can be shown that (with a single core) 

t* « 1 ns, (24) 

and in what follows we consider an evolution with t qc 
longer than t* . The result (j2~4")l is only weakly dependent 



on the particular algorithms used to integrate the ordi- 
nary differential equations and diagonalize the Hamilto- 
nian. Although a precise value of t* is not needed here, 
it is important that it is sufficiently less than the coher- 
ence time of the quantum computer. The classical (single 
core) simulation time is then equal to the matrix expo- 
nentiation or diagonalization time, which is about Is for 
a 1000 x 1000 real, symmetric matrix. Thus we would ob- 
tain speedup relative to a single core if we can perform 
a 1000 qubit simulation, and read out the result, in less 
than about Is. 

Our objective is to achieve speedup relative to a state- 
of-the-art supercomputer, not a single core. The classi- 
cal simulation time (matrix diagonalization time) then 
needs to be evaluated on a supercomputer, using an op- 
timally distributed parallel algorithm. However, we can 
bound the parallel performance by using the single-core 
result and assuming perfect parallelization efficiency. If 
we approximate a petaflop supercomputer by 10 6 gigaflop 
cores, we conclude that the classical calculation time can 
be no shorter than 10~ 6 times the single-core time. For 
the n = 1000 case considered above the classical sim- 
ulation is therefore no shorter than t c \ — 1 /is, while 
the quantum simulation (including readout) can be per- 
formed in a few hundred ns. Thus we would achieve a 
genuine quantum speedup for Schrodinger evolution with 
a 1000-qubit SES simulator. 



IV. CONCLUSIONS 

It is tempting to conclude that by working in an expo- 
nentially small (approximately invariant) subspace of the 
full n-qubit Hilbert space, any computational power that 
quantum physics has over classical is sacrificed, but this 
is not the case. Any BQP computation necessarily ac- 
cesses only exponentially small subspaces (polynom ially 
larger than the SES, but still exponentially small) [52j ]. 
This is because a quantum computer starts in a prod- 
uct state and then implements a polynomial number of 
two-qubit entangling gates. 

It is also interesting to compare the space resource re- 
quirement (number of physical qubits) of the SES method 
relative to a fault-tolerant quantum computer. Because 
approximately 10 4 physical qubits are required per ideal 
qubit in the surface code (this includes 10 logical qubits 
per ideal for magic state distillation), the logical Hilbert 
space simulated has dimension 



>10 _4 n 



(25) 



which is smaller than n when n < 1.7 x 10 5 . This shows 
that for a superconducting quantum computer with fewer 
than about a hundred thousand physical qubits, an 
SES processor will simulate a higher dimensional Hilbert 
space than the surface code will, although the fault- 
tolerant computer will of course run for a much longer 
time. 
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We have argued here that the 1000-qubit SES quan- 
tum simulator should be capable of speedup relative to 
a petaflop supercomputer. However, there are important 
limitations of the SES method: 

1. We require a fully connected network with tunable 
coupling between every qubit, which therefore has 
n(n — l)/2 coupling circuits and wires. This may 
make the SES approach unscalable beyond a few 
thousand qubits. 

2. We require an exponentially large number of phys- 
ical qubits relative to the standard model. How- 
ever, because of the large physical qubit overhead 
required by error correction, the SES method will 
be simulating a larger Hilbert space until there are 
about a hundred thousand physical qubits. 

3. The maximum computation time is limited by the 
coherence time of the hardware and calculations 
cannot run indefinitely. This applies to all known 
pre-threshold approaches, including analog quan- 
tum simulators. 

4. Computations have fixed accuracy and cannot be 
made arbitrarily accurate. This also applies to 
all known pre-threshold approaches, including ana- 
log simulators. Error sources include decoherence, 
leakage out of the SES, readout errors, as well as 
pulse control errors. 

5. In the present SES approach the simulated Hamil- 
tonian must be real. 

6. In our estimates of classical simulation time, we 
have not included the time needed to store the 
Hamiltonian in memory or perhaps compute it from 
a separate procedure. Similarly, for the quantum 
simulation time estimates we have not included the 
time required to send the n(n + l)/2 voltage signals 
to the qubits and couplers before the simulation, 
and to reset the readout circuitry afterwards. 

7. It is important to emphasize that the SES method 
implements a weak simulation (providing a single 
sample from the distribution of possible measure- 
ment outcomes), not the probability distributions 
themselves as is normally provided classically. This 
is of course also a limitation of the conventional 
approaches designed for error-corrected quantum 
computers, and is usually a limitation of analog 
quantum simulation as well. 

8. We have argued here that a quantum speedup is 
possible with 1000 qubits, but not every 1000-qubit 
simulation will possess this speedup (this depends 
on the particular simulated Hamiltonian, the value 
of A, and the desired accuracy). 

9. The SES method is unsuitable for applications (in- 
cluding Shor's) requiring enormous Hilbert spaces. 



We note, however, that in contrast to efficient gate-based 
simulation intended for a scalable quantum computer 
S Hi > the SES approach is not restricted to local (or 
sparse) Hamiltonians. 

The quantum simulation application discussed here is 
restricted to time-independent Hamiltonians. The time- 
dependent case, discussed previously in Ref. [49j . is per- 
haps even more interesting. However, the estimates of 
both t c \ and t qu in the time-dependent case are some- 
what more subtle, and for the purpose of establishing 
speedup it is simpler to treat the time-independent case. 

The SES method appears to be distinct from previ- 
ously investigated approaches to quantum computation. 
Like analog quantum simulation, it is tied to a very 
specific hardware model and cannot be implemented on 
any architecture. However, it enables (almost) universal 
quantum computation and simulation, and might make 
practical a first-generation universal quantum simulator 
capable of achieving genuine quantum speedup. 
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Appendix A: General coupling types 

Here we discuss the generalization of the SES method 
to FQC models of the form 

H = e ^ + \ H 3w -v < ® <• . ( A1 ) 



where the (with fi = x,y, z) are Pauli matrices and 
J^ v is a fixed, real, dimensionless tensor determined by 
the hardware. In this case the SES matrix elements are 



\H\i') 



j 3<j' 



J xx ~l~ Jyy i'i.Jxy Jyx) 



(A2) 



Note that the term proportional to (Xw<j' 9jj')^W is an 
energy shift and can be dropped. The results given in 
the main body of the paper apply when 



J. X 



and 



J* 



Jyy ^ 



J y 



(A3) 



(A4) 



The condition (|A3[) means that the interaction has an 
exchange or transverse component, and (|A4p ensures that 
gu> is purely real. 
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Further generalizations are possible, including cases qubit terms, and cases where gw is complex, but these 
where the coupling introduces both single-qubit and two- will not be considered here. 
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